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ABSTRACT 



We apply the method of Burnett & Binney (2010) for the determination of stellar distances and parameters to the internal catalogue 
of the Radial Velocity Experiment (Steinmetz et al. 2006). Subsamples of stars that either have Hipparcos parallaxes or belong to 
well-studied clusters, inspire confidence in the formal errors. Distances to dwarfs cooler than ~ 6000 K appear to be unbiased, but 
those to hotter dwarfs tend to be too small by ~ 10% of the formal errors. Distances to giants tend to be too large by about the same 
amount. The median distance error in the whole sample of 216 000 stars is 28% and the error distribution is similar for both giants and 
dwarfs. Roughly half the stars in the RAVE survey are giants. The giant fraction is largest at low latitudes and in directions towards the 
Galactic Centre. Near the plane the metallicity distribution is remarkably narrow and centred on [M/H] = -0.04 dex; with increasing 
\z\ it broadens out and its median moves to [M/H] =s -0.5. Mean age as a function of distance from the Galactic centre and distance \z\ 
from the Galactic plane shows the anticipated increase in mean age with \z\. 



Key words. Stars: distances, Stars: fundamental parameters, Galaxy: abundances, Galaxy: disk, Galaxy: stellar content, Galaxy: 
structure 

1 . Introduction going, and has provided a number of important results regard- 
ing the_structrrre_and kinematics of the Galaxy (see for exam- 

In recent years there have been significant advances in our pie Smith et al. 2007, Seabroke e t"al.ll2008l i Munari et al.ll2009l 

knowledge of the Galaxy due to a number of large-scale sur- [Siebert et al .l l2008l ISiebert et al.l l2010l). 

veys i n complementa ry magnitude ranges. The Hipparcos mis- One of the challenges thrown up by recent surveys, con- 

sion (Pe rrvmanl Il997l) obtained astrometry of unprecedented centrating as they do on relatively distant stars, is that of dis- 

precision for a sample of bright stars that was complete only tance estimation. The astrometric precision required to mea- 

down to V ~ 8, and at the other end of the scale we have sure trigonometric parallaxes to these stars is not yet avail- 

the Sloan Digital Sky Survey (SDSS, lYork et all |2000), go- able, so we must rely on secondary methods for determining 

ing no brighter than around ma gnitude r = 1 4 and signif- stellar distances. A u seful step in this direction was taken by 

icantly fainter than r — 18 (tlvezic et al.1 1200 lh . The sizable iBreddels et al.l (I2010J) . who showed that RAVE data could be 

gap between these t wo studies is filled by the Radial Velocity used to estimate the parameters and thus distances for objects 

Experiment (RAVE, ISteinmetz et al.l 120061) . focusing primarily in the second RAVE data release. Their technique was honed by 

on magnitudes in the range 9 < / < 13. RAVE is still on- IZ witter et al.l (12010). who used spectrophotometric distances to 



Burnett et al.: Distances to RAVE stars 



characteris e the RAVE survey . An al ternative approach was de- 
veloped by Burnett & B innevl (1201 Ol) in which our prior knowl- 
edge of the Galaxy and stars is more systematically exploited. In 
this paper we apply this technique to ~ 216 000 stars observed 
by RAVE. 

We determine spectrophotometric distances in parallel with 
other stellar parameters, in particular metallicity, age and mass, 
so we are able to study how the giant/dwarf ratio and the dis- 
tributions in age and metallicity vary in the region surveyed by 
RAVE. Previous applications of Bayesian inference to the deter- 
mination of ste llar parameters have focused on single parameters 
such as age ( J 0rgensen & Lindegr en 2005); to our knowledge, 
this is the first study to consider all parameters together. 

In Section [2] we specify the data on which our distances are 
based, and in Section[3]we summarise the algorithm used to cal- 
culate distances. Section [4] tests the derived distances by com- 
paring them with (i) Hipparcos parallaxes and (ii) established 
cluster distances. The comparison with Hipparcos parallaxes un- 
covers slight biases in the distances to hot dwarfs and giants, 
respectively. Section [4] tests the validity of the formal errors by 
(i) comparing distances to the same stars derived from indepen- 
dent spec tra, and (ii) c omparing our distances with those ob- 
tained by Zwit ter et al.l (2010). The algorithm determines prob- 
ability distributions for several stellar parameters in addition to 
distance. In Section [5] we display the distribution of formal er- 
rors in metallicity, age and initial mass, in addition to those in 
distance. In Section[6]we use our distances to explore which re- 
gions of the Galaxy are probed by the RAVE survey, and indicate 
which regions are predominantly probed with dwarfs or giants. 
In Section [7] whether examine how how mean age and metallic- 
ity vary within the probed portion of the Galaxy. Section[8]sums 
up. 

2. The input data 

Since it first took data in 2003 March, the RAVE survey has 
taken in excess of 400 000 spectra with resolution R =* 7500. 
A sophisticated reduction pipeline is required to recover stellar 
parameters such as T e g , log g and [M/H] from this huge dataset. 
The data pipeline involves several parameters whose values have 
to be optimised. Unfortunately, the values that are optimum for 
one purpose are not optimum for another. In particular, we will 
see in 34. 1 1 that the settings that are optimised for dwarfs are 
sub-optimal for giants, and vice versa, so in this work we use 
distinct implementations of the pipeline for stars which have 
logg greater than, or less than 3.5, with the split based on the 
value of log g returned by the version of the pipeline that is used 
for the high-gravity stars. This is the "VDR2" version of the 
pipeline, which was used for RAVE's 2nd data release (Zwitter et 
al. 2008) and also to produce a much larger data set that was re- 
leased to the collaboration in 2010 January. The low-gravity stars 
are processed with the "VDR3" version of the pipelines, which 
was used for RAVE's 3rd data release (Siebert et al. 2011) and 
also to produce data released to the collaboration in 2010 July. 

We consider only data for stars that have spectra with signal- 
to-noise ratio S IN > 20, 2MASS photometry with quoted error 
in. J — K less than 0.3, and values from RAVE for T e s, logg, 
[M/H] raw , and [a/Fe]. These criteria yield data for 216 064 dis- 
tinct objects. We have obtained distances and stellar parameters 
for these stars. 

We neglect the effects of extinction by dust because (a) 
we use near-infrared data, (b) most of the sample lies at quite 
high Galactic latitude, and (c) we do not have a straightforward 
method of estimating the reddening of individual stars. 



The metallicities given in the RAVE data releases can be 
refined using calibration coefficients determined by comparing 
the raw RAVE metallicities with those obtained from high- 
resolution spectroscopy of a subset of stars. Each version of 
the pipeline comes with a calibrations: for VDR2 we have 
dZwitter et al.ll2008l) 

[M/H] = 0.938[M/H] raw + 0.767[a/Fe] - 0.064 logg + 0.404, 

while for VDR3 we have dSiebert et al.ll201ll) 



[M/H] = 1.094[M/H] raw + 1.21 [a/Fe] -0.711 ■ 



T e S 



+ 0.763 



5040 K 

In the rest of this paper, all input values of [M/H] are those ob- 
tained by using the appropriate calibration formula above. 

The observational errors of each star depe nd on the star's 
signal-to-noise ratio (S /N). From Fig. 19 o flZ witter etaH d2008l) 
we take the errors on temperature, gravity and metallicity at 
S/JV = 40tobe 

cr log7 = 0.0434, (1) 

_ J 0.5, ifr eff <8 000K; 

°"iogg - | o.25 +0.436 log (g^jf), otherwise; (2) 

o"[M/H] = 1.071og7/ eff -3.71. (3) 

In the great majority of cases these errors are sig nificantly 
larger than those given in Table 4 of ISiebert et alj d201lh be- 
cause the latter are simply internal errors obtained from re- 
peat obser vati ons of the same star. As will become apparent 
in Section 14.11 there is a substantial component of system- 
atic error in the stellar parameters, arising from the way the 
observed spectra hav e been fitted to stellar templates. The 
results of Section I4TT1 show that the errors estimates we use 
are broadly correct. 

Four entries in Table 4 of Siebert et al. (2011) show er- 
rors in log g that are more than 25 percent larger than those 
we have assumed, all for values of log g - 2 or 2.5. In three 
of these cases the associated error in T eS is slightly larger 
than the value we have assumed. If future versions of the 
pipeline clearly show significant increases in the errors at in- 
termediate gravities, the errors used in the distance-finding 
algorithm should be modified accordingly. 

We obtain the errors at other values S /N + 40 from the 
empiri cal scaling given by eqs. (22) and (23) of IZ witter et al.l 
(2008). Errors on colour and magnitude were taken from the 
2MASS measurements on a star-by-star basis. No allowance is 
made for the error in [a/Fe]. 

W e take J and K ma gnitudes from the 2MASS cata- 
logue (ISkrutskie et al.ll2006l) . and we use the Padova isochrones 
(Bert elliet alj 120081) - these isochrones are the only widely 
available ones tha t reproduce the red clump effectively 
(IZwitteretal.ll2010h . 

3. Theory 

We begin by brie fly re capping the formalism developed in 
Burnett & Bin nevl (l2010h . where a method for estimating the 
value of, and error on, each star's d i stance , metallicity, age and 
mass was presented - Pont & Ever (2004) give a useful intro- 
duction to the general methodology. We take the relevant ob- 
servables for each RAVE star to be the logarithm of effective 
temperature T e ff and surface gravity g, the calibrated metallic- 
ity [M/H], colour J — K and apparent /-magnitude. We combine 
these into the vector of observables 



y = (log r eff , log*, [M/H], J-KJ). 



(4) 
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Each star is assumed to be characterised by a set of 'intrinsic' 
parameters: true metallicity [M/H] t , age r, initial mass M and 
heliocentric distance s, which together form a second vector 



x = ([M/H]„logT,A-U). 



(5) 



We assume Gaussian observational errors on each component of 
y, thus the measured values y for a star have probability density 
function (pdf) 



P(y I y(x), er v ) = G(y, y(x), <r v ), 



(6) 



where y(x) represents the true observables corresponding to in- 
trinsic stellar values x, and for an n-tuple w G is defined to be 
the multivariate Gaussian 



G(w,//, o") = f] (a-i V2^~' exp(-(w,- - Hiflrf) ■ 



(7) 



The pdf of a star's intrinsic parameters x is then conditional upon 
its observed values y, the observational errors <t v and the fact S 
that the star is observed in the survey in ques tion. We thus seek 
the po sterior pdf p(x | y, cr y ,S). It is shown in lBurnett & Binnevl 
(2010) that the moments of this distribution are given by 



In 



Jd 4 xx*0(x)G(y,y(x),o-,)p(x), 



(8) 



where <p(x) describes any part of the survey selection function 
that cannot be expressed as a function of y. This leads to a value 
for the expectation of each stellar parameter through 

C*> = £, 

and an uncertainty defined by 



(9) 



n = V 1 



cr i =^j(Ia/Iio)-<x i ) 2 . (10) 

Th e data were analysed using the prior of lBurne tt & Binney 
(2010), namely a three-component Milky Way model of the form 



P (x) = p(M) J] P i([M/H\) Pi(T) Pt(r), 



(11) 



i=i 



where i = 1, 2, 3 correspond to a thin disc, thick disc and stellar 
halo respectively. We assumed the same i nitial mass function 
(IMF ) for all three components following iRroupa et alj (1993) 
and lAumer & Binnevl ([2009), namely 



P(M) ■ 



M-1.3 

0.536 Mr 22 
. 0.536 AT 2 - 519 



if/K<0.5M G , 
ifO.5M < M< 1M , 
otherwise. 



(12) 



The star-formation rate within the thin disc is assumed 
to have d eclined exponentially with time constant 8.4 Gyr 
(lAum er & Binnev 2009). The ages of halo and thick-disc stars 
are uncertain. We merely assume that the ages of all halo stars 
exceed 10 Gyr, while those of thick-disc stars exceed 8 Gyr. 
Since the thick disc is generally thought to be younger than most 
of the halo, we further assume that no thick-disc star has an age 
in excess of 12 Gyr. The three components are therefore: 



Thin disc (i = 1): 

Pi([M/H]) = G([M/H], 0,0.2), 

Pi (t) oc exp(0. 1 19 r/Gyr) for t < 10 Gyr, 



(13) 



pi(r) oc exp 



R 

othin 



Table 1. Values of disc parameters used. 



Parameter 


Value (pc) 


pthin 


2 600 


-thin 


300 


Dthick 


3 600 


-thick 


900 



Thick disc (i = 2): 

p 2 ([M/H]) = G([M/H],-0.6,0.5), 

piif) °c uniform in range 8 < r < 12 Gyr, 



(14) 



p 2 (r) oc exp 



R 



Dthick 

K d 



_thick 



Halo {i = 3): 

p 3 ([M/H]) = G([M/H], -1.6,0.5), 

Pj,(t) oc uniform in range 10 < t < 13.7 Gyr, 



(15) 



p 3 (r) oc f 



-3.39 



Here R signifies Galactocentric cylindrical radius, z cylindrical 
height and r is spherical radius. The parameter values are given 
in Table [Q 



4. Tests 

In this section we show the results of a number of tests we per- 
formed to check the reliability and consistency of our stellar dis- 
tances. 



4.1. Hipparcos stars 

iBurnett & Binnevl (I2010I) demonstrated the robustness of the 
technique described in Section [3] on the Geneva-Copenhagen 
sample (Holmberg et al. 2009). However it is clearly impor- 
tant also to make sure that it functions correctly on the RAVE 
data. Consequently, we now investigate the performance of our 
method on the subset of RAV E stars that are in the Hipparcos 
catalogue, in its re-reduction by Ivan Leeuwenl (I2007I) . 

We identify RAVE stars that are in the Hipparcos Catalogue 
by requiring that sky positions (after updating the Hipparcos 
positions to J2000) coincide to 1 .5 arcsec, and proper motions 
coincide to 2cr, where <x is the quadrature-sum of the errors in 
the proper motions in the RAVE and Hipparcos catalogues. This 
process leads to 4582 matches, but these matches include only 
4080 distinct stars; the remaining matches arise from multiple 
RAVE observations of the same object. 

Given that the Bayesian method can o utput parallaxes as 
easily as distances, Burnett & Binnevl (1201 Ol) argued for com- 
parisons with the Hipparcos data to be performed in parallax 
space. This is what we do in this section. It proves instructive 
to compare the stars in three groups: "giants" (iogg < 3.5), 
"cool dwarfs" (\ogg > 3.5 and T e ff < 6000 K) and "hot dwarfs" 
(logg > 3.5 and T^ > 6000 K). Fig. [TJ shows the normalised 
residuals 
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Fig. 1. Upper panels: distribution of normalized residuals (fT6l l between our spectrophotometric parallaxes and values from 
Hipparcos. In this and subsequent histograms the quantity plotted vertically is the number of objects in the bin divided by the 
bin's width, and an error bar shows the statistical uncertainty of each point. The mean and dispersion of each distribution are given 
at top right. A Gaussian of zero mean and unit dispersion is over-plotted. Lower panels: the distribution of fractional uncertainties 
in the spectrophotometric parallaxes. For the top panel the errors are found by adding our uncertainty and that of Hipparcos in 
quadrature; for the bottom panel the errors are just the spectrophotometric ones. 



between the photometric and Hipparcos parallaxes for each of 
these groups. In each case the dispersion of the residuals is close 
to unity, which confirms the accuracy of the derived errors. For 
the cool dwarfs the mean residual is pleasingly close to zero, but 
the mean residual of the hot dwarfs is distinctly positive, indi- 
cating that the photometric parallaxes tend to be larger than the 
trigonometric ones. Fig.|2]makes the cause of this anomaly clear 
by showing the density of RAVE stars with S /N > 20 in the 
(Jeff, log g) plane together with isochrones for ages 3 Gyr (full) 
and 9 Gyr (broken) and metallicities ranging from [M/H] = - 1 
(red) to [M/H] = 0.1 (dark blue). For T eS > 6000 K and 
[M/H] > -0.4 there is a clear tendency for stars to be concen- 
trated at higher values of log g than the isochrones allow. This 
shortcoming of the data fed to the Bayesian algorithm leads to 
the systematic under-estimation of the radii and therefore the lu- 
minosities of stars. Consequently, the predicted parallaxes are 
larger than they should be. Fig. [2] shows that the displacement 
of hot, relatively metal-rich stars to excessive values of log g is 
significantly more pronounced with the VDR3 pipeline (lower 
row) than with the VDR2 pipeline, and the stellar parameters 
from VDR3 yield a mean value of % w for the hot dwarfs which 
is as large as 0.273. 

The top right panel of Fig. Q] shows that the spectropho- 
tometric parallaxes of giants are systematically too small, so 
fl m = -0. 1 1 . When the parameters output by the VDR2 pipeline 
are used for giants, we obtain f{ m - -0.19. In fact, the VDR3 
pipeline returns values of log g that are systematically larger than 
those returned by the VDR2 pipeline. For dwarfs the increases 
are relatively large and for many stars the VDR3 values are phys- 
ically implausible. For giants the increases are smaller and seem 
to be beneficial. 

The mean and dispersion displayed in Fig. Q] are for the dis- 
tributions once we clip outliers, defined as stars with normalized 
residuals of modulus greater than four. This excludes eight stars 



of the original 4080. Table [2] lists some data for these objects, 
one of which has repeat observations. It is notable that stars with 
log g < 4 are predicted to have smaller parallaxes than Hipparcos 
measured. For example, the logg value of Hipparcos 6075 is 
strongly indicative of a giant, so the stellar models predict for 
it an absolute /-magnitude in the range Mj € (-5.7,0.5); com- 
bined with its measured J — 8.68 this would imply a parallax 
in the range m € (0.2, 4.2) mas. Consequently the assignment 
of (w) e (0.42, 1.04) mas is eminently reasonable but in strong 
conflict with the Hipparcos value, 28.5 mas. The Hipparcos mag- 
nitude of this star is three magnitudes fainter than its value 
of / from 2MASS. Either the star is exceptionally red, or the 
Hipparcos and 2MASS data relate to different objects. 

Two of the stars with small \ogg and under-estimated 
parallax (Hipparcos 44216 and 46831) lie quite near 
the Galactic plane and may well be obscured, which 
would cause the predicted parallax to be too small. 
At www.rssd.esa.int/SA/HIPPARCOS/docs/voll l_all.pdf 

Hipparcos 44216 is listed as a periodic variable star with 
amplitude 0.18 mag. 

Hipparcos 65142 has J - K - 0.799, which is consistent 
with its being either a giant or a dwarf, depending on its metal- 
licity. The RAVE catalogue gives [M/H] raw = 0.2 and at this high 
metallicity a dwarf is consistent with the colour. The algorithm 
chooses this solution because log g is measured to be 4.4. On ac- 
count of the prior, the algorithm returns a probability distribution 
for [M/H] t , 0.04 + 0.14, that is centred on a smaller metallicity. If 
[M/H] t were near the bottom of this range the star could not be a 
dwarf and the predicted parallax would fall to near the Hipparcos 
value. 

The other outliers in Table [2] with log g > 4 all have T e g > 
7000 K, and with one exception have over-estimated parallaxes. 
Like Hipparcos 65142, these stars have probably been assigned 
values of log g that are too large and are in consequence pre- 
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Fig. 2. The density of RAVE stars with S/N > 20 in the (r e ff, log g) plane together with Padova isochrones for ages 3 Gyr (solid) 
and 9 Gyr (dashed). The colours indicate calibrated metallicities: [M/H] = — 1 red; [M/H] = -0.6 yellow; [M/H] = -0.2 green; 
[M/H] = light blue; [M/H] =0.1 dark blue. The top row shows parameters from the VDR2 pipeline, while the lower row is for 
the VDR3 pipeline. In each row stars are grouped by metallicity, with the highest metallicities on the right. Note that in several 
panels, in the temperature range 6000 < T e ff < 7000 the density of stars is high below the lowest isochrone. This anomaly is most 
pronounced in the lower row. 

Table 2. Outliers from the analysis of Hipparcos stars. K^ symbolises normalized residuals; parallaxes are measured in mas. 
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-1.17 


51 


97962 


13.64 ± 1.71 


0.54 ±0.12 


12.6 


-25.4 


10.67 


10.14 


18563 


4.99 


-0.59 


30 


101250 


1.17 ±0.85 


12.44 ±2.18 


16.0 


-33.0 


7.69 


8.40 


8130 


4.26 


0.63 
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dieted to be less luminous than they really are. T he one star that 
has un der-estimated parallax is Hipparcos 97962. lTsvetkov et al.l 
(2008) have already noted that the spectroscopic distance to this 
star exceeds the distance inferred from its parallax by a factor 
~ 37. They conclude that it is a B9V star and a member of a 
multiple star system. Both of its RAVE spectra imply that it is 
a very hot star. The highest temperature provided by the model 
isochrones is 30 410 K, so even with allowance for observational 
error, the temperature from the higher S/N observation cannot be 
matched by the program. Moreover, the star's metallicity, -1.17, 
falls below that of our lowest-metallicity isochrone. Clearly we 
should exclude from analysis all stars that are so incompatible 
with the models. In practice we excluded all observations with 
r e ff above the maximum model temperature (r e ff = 33 600 K). 
Fortunately, this cut removed only two stars from the sample. 



4.2. Cluster stars 

One other test we can perform inv olves finding t he dis tances to 
RAVE stars that were identified bv lZwitter et al.l (2010) as lying 
in clusters with known distances. Ten of these stars are giants 
(logg < 3), so provide a good test bed for the analysis of such 
stars, which have traditionally p roved difficult for spe ctrophoto- 
metric distance techniques to fit dBreddels et al.l20 10). The large 
distances to these stars also provide a good test of the safety 
of neglecting reddening. Star OCL00277_2236411 has a repeat 
measurement in the RAVE catalogue and hence we fitted it twice 
using both sets of results. 

The results of our fitting of these fifteen stars are shown in 
Tableland Fig. [3] All but one of the stars lie within 1 <x of their 
literature distances, implying a reasonable fit to both dwarfs and 
giants, with no evident bias. The mean value of the normalised 
residual H s — (s - s c )/cr s is 0.28, where cr s is the formal error in 
our distance s. If our distances were unbiased and the literature 
distances s c were exact, the distribution of Us would have zero 
mean and a dispersion of unity, and the dispersion will in prac- 
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Table 3. Expectation values (s) of the distances to cluster mem- 
bers a nd their uncertainties both in this paper and in lZwitter et al.l 
d201Ch . 



Star ID 



Cluster 
distance (pc) 



this paper 
(s) ± a s (pc) 



Z10 
(s) + cr s (pc) 



OCL00148.1373319 


770 


1401 ± 502 


1120 ±230 


OCL00147.1373471 


490 


653 ± 250 


620 ± 100 


OCL00277.2236411 


490 


629 ± 208 


540 ± 50 


OCL00277.2236411 


490 


729 ± 267 




OCL00277.2236511 


490 


459 ± 169 


450 ± 60 


T7751_00502.1 


938 


994 ± 247 


1040 ± 150 


J000324.3-294849 


270 


238 ± 45 


240 ± 20 


J000 128.6-30 1221 


270 


289 ± 70 


240 ± 20 


J125905.2-705454 


5900 


5908 ± 881 


5500 ± 800 


M67-0105 


910 


788 ± 241 




M67-0135 


910 


1126 ±290 


1130 ±290 


M67-0223 


910 


867 ± 256 


890 ± 140 


M67-2152 


910 


912 ±194 


960 ± 150 


M67-6515 


910 


843 ± 300 


990 ± 190 


J075242.7-382906 


1300 


1896 ±611 


1290 ±220 


J075214.8-383848 


1300 


1588 ±583 


1740 ± 330 



10 



1000 - 



100 



100 




1000 
cluster s/pc 



Fig. 3. Estimated distances (s) to cluster stars plotted against 
cluster distances from the literature. Dwarf stars are marked by 
open octagons and giant stars are marked by filled squares. The 
dotted lines show equal distances and distances that differ by 
30%. Stars in the same cluster have been slightly offset horizon- 
tally for clarity. 



tice be greater than unity because the literature distances will 
contain errors. Hence we expect the mean value of 9i s to dif- 
fer from zero by in excess of 1/V15 in addition to any bias in 
our distances. Hence the actual mean, CR S ) - 0.28 is consistent 
both with no bias and the level of bias revealed by the Hipparcos 
sample. The best-fit zero-intercept straight-line fit to the points 
in Fig.[3]has a gradient of 1.03, again consistent with no bias. 

We conclude that in contrast to the finding of Breddels et al., 
our results for giants appear to be as reliable as those for dwarfs. 
Furthermore the lack of a demonstrable bias for these more dis- 
tant stars implies that our distances are valid despite our neglect 
of reddening, which is justified a priori by the use o f infrared 
magnitudes and the arguments of IZ witter et al.l (12010). 

4.3. Repeat observations 

A significant number of RAVE stars have been observed more 
than once. Although these repeat observations will not reveal 



systematic errors, they do provide a valuable test of the quoted 
errors. 

Fig- El shows the outcome of this test, using 45475 spectra 
of 19 094 distinct stars. The top panel shows the distributions of 
distance discrepancies divided by the mean distance for giants 
and dwarfs. For the giants the median fractional distance residual 
is 9.2%, while for the dwarfs it is only 7.3%. Taking both popu- 
lations together, we find that 68.2% of the points lie at a scatter 
of below 13.3%, implying that this may be a more realistic es- 
timate of the average distance error than the value implied by 
the formal errors, 28% (top left panel Fig.|7]below). The lower 
panel of Fig. [4] shows the distributions of normalised residuals, 
where the normalising factor is the quadrature-sum of the formal 
errors on each distance. For both the giants and the dwarfs, the 
dispersions of these distributions are considerably smaller than 
unity, again implying that the formal errors are excessive, un- 
doubtedly because they are based on the conserva tive estimates 
of the errors on the stellar parameters derived in IZ witter et al.l 
J2008h . 

Of course repeat observations will not reveal systematic er- 
rors arising from shortcomings in the isochrones and spectral 
templates, for example. However, the work with Hipparcos and 
cluster stars described in Sections r4.1l and l4~2l stronglv limits the 
scale of systematic errors, which would not be detected by Fig. [4] 
Consequently, we conclude that the mean error in our distances 
is ^ 20%, which is a good level of accuracy for a spectrophoto- 
metric technique. 

In what follows, we use for stars with several spectra the 
weighted averages of parameters from individual spectra, with 
the weights taken to be the S /N ratios of the spectra. This leaves 
us with data for 209 950 distinct stars. 



4.4. Consistency of distances to subgiants 

Our use of distinct pipelines to assign parameters to stars 
with logg < 3.5 and logg > 3.5 raises the question of 
whether distances are assigned consistently to stars that have 
logg a 3.5, which may lie on one side of the dividing line 
rather than the other merely by virtue of noise. Fig. \5\ ad- 
dresses this concern by comparing the VDR2 and VDR3 dis- 
tances for the 15 525 stars with logg in (3.3,3.7). It shows 
the distribution of svdr2 - svdr3 normalised by the quadra- 
ture sum of the formal errors in each distance. The mean of 
the distribution is extremely small (-9 x 10 5 ) although the 
mode of the distribution lies near 0. 1 . It is to be expected that 
the higher values of log g returned by VDR3 should yield the 
smaller distances so we expect the peak in Fig. \E\ to lie at a 
positive value of the ordinate. In view of the small value of 
the mean of the distribution, using one pipeline rather than 
the other for stars in the transition region cannot be said sys- 
tematically to bias the derived distance. 

4.5. Comparison with \Zwitter et al\ \201(\) 

It is interesting to co mpare our dis t ances with those derived from 
the same spectra bv lZwitter et al.l d2010l) . which gives distances 
obtained from the VDR3 pipeline with three different isochrone 
sets. Here we consider only the distances Zwitter et al. obtained 
from Padova isochrones. We use the "new, revised" distances de- 
scribed in the note added to Zwitter et a l. (2010) in proof; these 
distances u se the less co n serva tive error estimates to which the 
analysis of ISiebert et al.l d201ll) gives rise. Our distances con- 
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Fig. 4. Top: the distribution of fractional differences in distances 
from repeat observations. Bottom: the distribution of distance 
differences divided by the quadrature-sum of the formal errors 
of each distance. As in Fig. \T\ the quantity plotted vertically is 
the number of stars in each bin divided by the bin's width, and 
the vertical bars show the statistical uncertainty. 
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Fig. 5. The distribution for 15 525 stars with 3.3 < log g < 3.7 of 
the differences 5vdr2 - *vdr3 normalised by the quadrature sum 
of the formal errors. 



tinue to be base d on th e older, more conservative error estimates 
of lZwitteret"ai1 (120081) . 

The final column of Table [3] shows the distances Zwitter 
et al. find for the cluster stars. The formal errors on these dis- 
tances are smaller than ours on account of the less conserva- 
tive errors that Zwitter et al. adopted for the input parame- 
ters. Otherwise the agreement with our distances is excellent. 
In the case of OCL00148J373319, for which the difference be- 
tween our distance and the literature value is largest (1 .25<x), the 
Zwitter et al. distance lies closer to the literature value than our 
distance does, although it is still larger than expected by 1.5<x. 
This star lies at Galactic latitude b - 3.6°, while most RAVE 



stars lie at much higher latitudes; only 5622 of the spectra un- 
der consideration come from \b\ < 6°. The distances derived for 
OCL00148.1373319 from RAVE data may be too large because 
the star is significantly obscured and we have neglected obscu- 
ration. 

The upper panel of Fig. |6]shows the distributions of the nor- 
malized residuals between our distances and those of Zwitter at 
al. when all stars are taken together. The mean normalised resid- 
ual is pleasingly small, 0.02. The lower panel shows the distri- 
bution of normalised residuals for giants and dwarfs taken sep- 
arately. The dwarfs have quite a narrow distribution of residuals 
(dispersion 0.52) but they are offset from zero by 0.14, imply- 
ing that the Zwitter et al. distances to dwarfs are systematically 
smaller than ours. We saw in 34.11 a tendency for our parallaxes 
for hot dwarfs to be larger than those measured by Hipparcos, 
so for these stars our distances are already too small. Thus the 
Zwitter et al. distances are offset from ours in the opposite di- 
rection from what one would expect if they were more accurate 
than ours. This result undoubtedly reflects the fact that all the 
Zwitter et al. distances are based on the VDR3 pipeline, which 
as Figure|2]attests, over-estimates logg for hot dwarfs. 

At 0.52 the dispersion of the normalised residuals of the 
dwarfs in Fig. [6] is substantially smaller than unity but larger 
than the dispersion of repeat observations of dwarfs (Fig. |3J, 
This situation is what one would expect given that the Zwitter 
et al. distances and ours derive from the same data but processed 
with different versions of both the reduction pipeline and the al- 
gorithm used to extract distances from stellar parameters. 

In Fig.|6]the distribution of normalised residuals between the 
Zwitter et al. distances for giants and ours is quite wide (disper- 
sion 0.86) and offset in the opposite direction to the distribution 
of dwarfs: the Zwitter et al. distances for giants tend to be larger 
than ours. Since the top right panel of Fig. Q] implies that our 
distances for giants are already larger than the Hipparcos paral- 
laxes imply, the implication is that the Zwitter et al. distances for 
giants are less accurate than ours. 

Whereas in Fig. [6] there is a contribution to the distribution 
of normalised residuals for dwarfs from the different pipelines 
used (VDR2 versus VDR3), there is no such contribution to the 
broader distribution of residuals for giants, and the entire width 
of the giant distribution derives from the different algorithms 
used to extract distances from a given set of stellar parameters. 

In fact the residual distribution can be und erstood in terms 
of of the different priors used here and in Zwitte r et all whereas 
our prior recognises the existence of three components in the 
Galaxy. IZ witter et akl used a simple prior involving an IMF and a 
magnitude-dependent effect to represent the probability of a star 
entering a magnitude-limited sample under the assumption of 
constant volume density. First, our prior incorporates the spatial 
inhomogeneity of the Galaxy's discs, which pulls stars towards 
smaller distances, particularly at high Galactic latitudes. The ef- 
fect on dwarf stars can be expected to be rather small, but the 
effect on giants is more marked: in order to fall into the survey's 
magnitude limits (which include both low- and high-magnitude 
cuts), a giant would have to be at a reasonable distance from 
the Sun, at which point (due to RAVE's range of Galactic lati- 
tudes) the disc's structure begins to play a significant role. This 
accounts for the leftwards wing of the red histogram in the bot- 
tom panel of Fig. [6] 

Second, we also have a prior on stellar age that favours older 
ages. This prior increases the likely luminosity of a star of given 
initial m ass over the m ost probable luminosity derived from the 
prior of IZ witter et all Conseque ntly, we ide ntify a significant 
number of stars as giants that IZwitter et al] considered to be 
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Fig. 7. Distributions of output errors for all four stellar parameters: distance s (top left); metallicity (top right); age (lower left); mass 
(lower right). 



dwarfs or subgiants. These stars appear as a noticeable positive 
wing in the red histogram of Fig. [6] 

Aside from these substantial differences in approach, the dif- 
ferent IMF and metallicity distributions taken in our study and 
that of IZwitter et alj make the small remaining spread seen in 
Fig.|6]eminently reasonable. 



5. Output precisions 

Fig- E] shows the distribution of formal errors in each stellar pa- 
rameter for the whole RAVE sam ple. Comparing the s e dis tribu- 
tions with those in figs. 3 and 10 of Burnett & Binney (2010), we 
see that our precisions are noticeably higher than forecast. The 
median formal error in distance is 28%, and in jj4.3l we saw that 
repeat observations suggested that the random errors are likely 
only half as large. The median input error in metallicity from the 
catalogue is 0.237, while the top right panel of Fig. |7]shows that 
the median output error is 0.17, so use of the prior diminishes the 
uncertainty by 29 %. This reduction is cons istent with the small 
scatter in Fig. 1 of lBurnett & Binnevl(l2010l) . 

6. The selection function 

Even though RAVE has one of the simplest and best-defined se- 
lection criteria of any large survey of the Galaxy, selection is 
based on colour and magnitude on the sky, and it is a non-trivial 
exercise to compute the resulting fraction of the stars in a given 
location in space that will be in the catalogue. Until these frac- 
tions are better determined, we cannot infer spatial densities of 
stars in the Galaxy from counts of stars in the RAVE catalogue. 
Hence at this stage we are restricted to three lines of enquiry: 
(i) what stars does the survey capture?, (ii) how do the distribu- 
tions of stellar parameters vary with position in the Galaxy?, and 



(iii) how do the Galaxy's kinematics vary with position, age and 
metallicity? In this section we focus on the first two lines of en- 
quiry. Our distances and other parameters will be used elsewhere 
to study the Galaxy's kinematics. 

In Fig. [8] we show the distribution of absolute /-magnitudes 
in different slices of Galactic latitude. The distributions are 
strongly bimodal: the red clump produces a narrow peak at 
(Mj) ~ -1 and turnoff stars a broader peak around (My) ~ 3. 
At \b\ < 25° clump stars completely dominate, while the turnoff 
stars outnumber giants above \b\ ~ 40°. This progression reflects 
the steepening gradient in the density of stars along the line of 
sight with increasing \b\ since clump stars must have a distance 
modulus of more than 8, and a distance ^ 400 pc, to enter the 
survey, and towards the pole there are many fewer stars at such 
distances than nearby dwarfs. In the absence of a steep gradient 
along the line of sight, clump stars dominate the survey because 
the latter was designed to select disc giants. Moreover the frac- 
tion of giants at \b\ < 25° is enhanced by the fact that during 
most of the survey a colour cut J - K > 0.5 has been imposed 
in the region 230° < / < 315°, \b\ < 25° precisely in order to 
favour the selection of giants - elsewhere selection has been on 
magnitude alone. 

Fig. [9] shows how the distribution in absolute magnitude 
varies with Galactic longitude. As the argument just given leads 
one to expect, giants are less prominent towards the anticentre 
than towards either the inner Galaxy or the tangent directions. 
However, the distributions in I are strongly influenced by the 
survey's non-uniform sky coverage. Towards the Galactic cen- 
tre, the RAVE fields extend much closer to the Galactic plane, 
and in such fields giants will be particularly prominent. 

It is interesting to compare the variation over the sky of the 
giant/dwarf rat io with that predict ed by the Galaxy modelling 
code Galaxia (Sha rma et alJ 1201 lh for a survey with RAVE's 
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Fig. 6. Comparison of our results with those of IZwitter et alj 

d2010l) . Top panel: all 215 167 stars with two distances; lower Fig. 8. The distribution of derived absolute magnitudes in slices 
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the points. of the entire sample. 



selection function. Fig. [10] shows the observed giant/dwarf ra- 
tio, while the bottom panel of Fig. QT| shows the prediction of 
Galaxia for this figure. The top two panels of Fig. QT| show that 
the predictions of Galaxia depend significantly on the model of 
extinction by dust (which is that of iBland-Hawthorn et al.ll2010l) 
by showing the giant/dwarf ratio predicted with (middle panel) 
and without (top panel) including extinction. The middle and 
bottom panels of Fig. [TTIshow the impact on the predicted gi- 
ant/dwarf ratio of RAVE's handling of low-latitude regions, es- 
pecially the imposition of the colour condition J - K > 0.5 at 
225° < I < 315°. In both Fig. [10]and the bottom panel of Fig. \U\ 
the giant/dwarf ratio is above unity only within ~ 25° of the 
plane, but near the plane it achieves very large values - in Fig.PTOl 
there are cells with more than 10 dwarfs and n g /«d > 54 - both 
on account of the length of sight lines through the disc, and on 
the imposition of the colour cut J - K > 0.5, which enhances 
the giant fraction. Consequently, the dark blue region of FigfTOl 
is heavily saturated. The numbers in the top left corner of each 
panel give the ratio n g /«d of the total number of giants to dwarfs, 
which is 1 .05 for the data and 1 .39 for the model. Given residual 
uncertainty as to what RAVE's selection function is at low lat- 
itudes, we consider the agreement between the values of njn& 
from the data and Galaxia to be satisfactory. 



7. Parameter distributions 

What does RAVE tell us about the variation from point to point 
in the Galaxy in the distribution of stars over age and metallic - 
ity? We must bear in mind that our prior has a bigger impact 



on these stellar parameters than on distances. Consequently, we 
investigate the effect of our prior on the results. 



7. 1 . Region probed by survey 

Fig. [12] shows the density of observed stars in the (R,z) plane. 
The density seen here is the product of three factors: (i) the in- 
trinsic density of stars in the Galaxy, (ii) variation with distance 
from the Sun that follows from the survey's faint and bright mag- 
nitude limits and the stellar luminosity function, and (iii) a bias 
against objects in the plane that is driven by a combination of 
obscuration and the survey's avoidance of low-latitude fields. 
Notwithstanding the strong impact of the biases (ii) and (iii), 
the basic structure of the Galactic disc is evident in Fig. Q~2] 

Regardless of the extent to which the density of observed 
stars reflects the Galaxy's intrinsic structure or selection effects, 
it tells us for which regions the survey carries useful informa- 
tion. At the solar radius this extends to ~ 3 kpc above and be- 
low the plane, and beyond the solar circle there is a steady but 
gradual narrowing in the width in z of the surveyed region with 
increasing R. Towards the centre the surveyed region terminates 
at larger values of \z\ than towards the anticentre. Fig. [T31 shows 
the distribution in distance of stars in three ranges of Galactic 
latitude: \b\ < 40°, 40° < \b\ < 60° and \b\ > 60°. The median 
distances in these three classes are 1.03 kpc for \b\ < 40°, 450 pc 
for 40° < \b\ < 60° and 372 pc for \b\ > 60°. 
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Fig. 9. The distribution of derived absolute magnitudes in ranges 
of Galactic longitude (I). The same vertical scale is used for 
all three panels so one gets an impression of the longitude- 
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Fig. 10. The variation across the sky of the giant/dwarf fraction, 
with giants defined by (Mj) < 1 and dwarfs by (Mj) > 1. White 
regions are not sampled by RAVE. 



7.2. Metallicity 

We now look at the variation of mean metallicity with distance 
from the Galactic plane. We have two different sets of metallici- 
ties to work with, since each star has both an observed value and 
that returned by the model fitting. We plot both sets of data si- 
multaneously in Fig. [14] Since the model isochrones only cover 
metallicities down to [M/H] = -0.914, distributions of output 
metallicities for \z\ > 1 kpc show a tendency for stars to pile up 
near this limit. At lower heights the output distributions are dis- 
tinctly tighter than those observed, and below ~ 300 pc they are 
displaced to slightly lower [M/H]. The output histograms tend to 
negligible values well ahead of the metallicity of the most metal- 
rich isochrone ([M/H] = 0.54), suggesting that there really are 
extremely few stars with [M/H] t > 0.2. The median value of 
the observational er ror for each slice (e stimated via eq.[3]in this 
paper and eq. 22 of IZwitter et al.l l2008) is shown as a red scale 
bar on each panel, and, given that there are smaller errors on our 
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Fig. 1 1. The predict ion of the Galaxy modelling code Galaxia 
dSharma et al.l 120 1 1 ) for the structure of Fig. [TU] The top two 
panels show all stars with 9 < I < 12, without (top) and with 
(middle) including Galactic extinction. In the bottom panel low- 
latitude fields have been excluded and for < 225° < / < 315° 
only stars with J - K > 0.5 are included. 
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Fig. 12. The distribution of observed stars in the (R,z) plane. 
Contours are logarithmically spaced and represent densities in 
logfstars kpc -2 ). 



output metallicities, it can be seen that, apart from the bias to 
low metallicity just discussed, the scale of these errors is very 
reasonable to explain the difference between the blue and red 
distributions in each case. 

The progression that would be expected in metallicity is 
clearly visible as one moves away from the plane: from a nar- 
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Fig. 13. Histograms of the distribution in distance of RAVE stars 
in three ranges of Galactic latitude. 



row thin-disc distribution at low \z\, there is a gradual shift to 
a broader thick-disc distribution beyond around one thin-disc 
scale height, ~ 0.3 kpc, moving towards a significantly lower- 
metallicity halo distribution as one moves beyond the thick disc 
scale height, 0.9 kpc. Unfortunately, the isochrones we have used 
are all too metal-rich for halo stars, so in Fig.[14]the small num- 
ber of halo stars in the RAVE sample cause an un-physical peak 
in the blue points at [M/H] = -0.9. The metallicity distribution 
for 1 k pc < |z| < 1 .8 kpc is similar to that found bv lBensbv et al.l 
(2007) for thick-disc stars except that ours extends ~ 0. 1 dex less 
far on the metal-poo r side. 

From SDSS data llvezic et al.l (120081) concluded that the me- 
dian metallicity in the disc decreases from [M/H] t = -0.6 at 
\z\ - 500 pc to -0.8 beyond several kiloparsecs. Fig. [T4limplies 
that at \z\ = 500 pc the median disc metallicity is > -0.1, and 
even at \z\ > 2 kpc it is no smaller than -0.5. 

Close to the plane the natural comparison is with the metal- 
licity distribution within the Geneva-Copenhagen survey (GCS) 
of H ipparcos stars dNordstrom et al.l l2004t iHolmberg et al.l 
2009). Unfortunately, the metallicities of these stars are some- 
what controversial. iFuhrmannl d2008l) finds that high-resolution 
spectroscopy of a sample of only 185 local F and K stars im- 
plies that the solar -neighbourhood distribution in [Mg/H] covers 
(-0.2, 0.2), while iHavwoodl (120061) argues that the metallicity 
distribution of young stars is intrinsically narrow and the spread 
in measured values of ~ 0.1 dex is dominated by measurement 
error. The bottom panel of Fig. [14] suggests that near the plane, 
the intrinsic spread in the metallicity is indeed narrow. 

The means of the distributions of input and output metallici- 
ties shown in Fig.[14]lie close to one another at all values of \z\. 
Fig. [I5]makes this fact clear by showing the output distribution 
in blue superimposed on the wider input distribution, plotted in 
red. The similarities of these means implies that the blue output 
metallicities are not merely reproducing our prior. 

7.3. Stellar ages 

Stellar ages are extremely hard to determine reliably. The most 
reliable ages are those obtained from the location in the (r e ff , L) 
of slightly evolved main-sequence stars with independently de- 
termined distances. Since we do not know the distances to stars a 
priori, our ages must be suspect. They are nonetheless of interest 
as sanity checks on the performance of the algorithm. They may 
even be serve as indicators of possible trends in the data, since 
even in the absence of an independent distance estimate for a 
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Fig. 14. The distribution in metallicity at several distance from 
the Galactic plane. Red points show observed metallicities, blue 
points show the output from our analysis. Values of \z\ are in kpc, 
and the median output metallicity for each slice is displayed. The 
red scale bar in each panel represents the median observational 
error for that subsample. 



input 
output 



\ 

-0.5 




z/kpc 

Fig. 15. The variation in the metallicity distribution as one moves 
away from the plane. Red: observed metallicities, blue: output 
metallicities. The solid line represents the median of each distri- 
bution and dashed lines show l<x variations from the median. 



star, the spectrum combined with the star's photometry does pro- 
vide some indication of its age. 

Fig-HSJdisplays a colour-scale plot of the average stellar age 
across the (R,z) plane. To some extent this distribution reflects 
our prior, but it is encouraging to see that the map conforms to 
our intuition: at small \z\ a young disc is dominant. This young 
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Fig. 16. The distribution of average stellar age in the (R, z) plane. 
The colour scale gives ages in Gyr. Regions for which there is 
inadequate data are white. 
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Fig. 17. As Fig. [16] but when the data are analysed with a prior 
that is completely fiat in age. 



structure dwindles as one moves outwards in R and away from 
z — 0. Far from the plane, the gradient in age becomes small, 
consistent with these regions being dominated by an old popula- 
tion that is not strongly concentrated to the plane. 

Note that shapes of the contours in Fig. [12] and Fig. [16] are 
quite different. This fact is reassuring, for it tells us that the mea- 
sured age distribution remains stable even when the survey picks 
up only a small fraction of the Galaxy's stars. 

Fig. [T7] addresses the fear that the age gradient evident in 
Fig. [T6]is an artifact produced by our chosen prior, by showing 
the distribution one obtains with a prior that is completely fiat in 
age from the present back to age 13.7 Gyr; all other elements of 
the prior (metallicity, number density, IMF) were unchanged. We 
see that dropping the prior causes the mean age for many regions 
to become ~ 6 Gyr. This happens because with a fiat age prior, 
the age pdfs of stars become broad and their means shift towards 
the centre of the permitted range. Although age is less strongly 
correlated with \z\ in Fig.[T7]than in Fig.[l6l the youngest stars 




age/yr 

Fig. 18. The distribution of stars in the age-metallicity plane at 
\z\ > 500 pc (above) and \z\ < 500 pc (below). The colour en- 
codes the base- 10 logarithm of stellar density, with red indicat- 
ing a complete absence of stars. The values of [M/H] used are 
the outputs of the distance-finding algorithm. 



remain concentrated to the plane, and above the plane there is 
a tendency for mean age to increase with R at fixed z as we ex- 
pect if a tapering young disc is superimposed on a broader old 
population of thick-disc and halo stars. Thus although the prior 
is having a significant effect on the age distribution we recover 
from RAVE, it is not entirely responsible for the nice age distri- 
bution seen in Fig.fTSI 

In Fig. [16] the immediate vicinity of the Sun seems to have 
a slightly older population than points just above the plane and 
~ 1 kpc further in or out. The number of stars seen near the plane 
and ~ 1 kpc from the Sun is small (Fig.[T2l and the stars we do 
see have a high probability of being obscured by dust. The ob- 
scuration will select against low-luminosity stars and thus favour 
the entry into the catalogue of hot young stars. It will also affect 
age determinations, but in an unpredictable way because T e g 
will be changed as well as the broad-band colours. Moreover, 
in low-latitude fields unusual objects were deliberately targeted 
by the RAVE survey. Consequently, the data for low-latitude re- 
gions that lie ~ 1 kpc from the Sun are suspect, and the more 
gradual falloff in age with height near the Sun is likely to be 
more representative of the disc than the steeper gradient seen 
further away. A countervailing consideration is that the bright- 
magnitude limit of RAVE will exclude nearby young stars and 
thus bias the nearby data towards older stars. Reassuringly, when 
Galaxia is used to simulate Fig. [16] a similar distribution of ages 
is produced. In particular, within the plane the mean age of stars 
decreases with distance from the Sun for s ^ 1 kpc. 
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7.4. Age-metallicity relation 

Fig. [18] shows the distribution of stars in the age-metallicity 
plane at \z\ > 500 pc in the upper panel and \z\ < 500 pc in 
the lower panel. At high |z| we see the expected concentration 
of thick-disc stars to high ages with a broad metallicity distri- 
bution. The ridge line of the population clearly shows a rapid 
increase in metallicity with time, to solar metallicity at an age 
of 6 Gyr. Nearer the plane, there is a significant population of 
young stars and a lower envelope to the distribution that allows 
for relatively young stars that have distinctly sub-solar metallic - 
ities. The highest density of thin-disc stars occurs at [M/H] t =* 
and age ~ 6 Gyr, distinctly younger than the maximum permit- 
ted age of disc stars. And at [M/H] t =* the density of stars tails 
off rapidly at ages in excess of 6 Gyr. At earlier times only metal- 
poor stars formed, and the rate at which they did so appears to 
have been fiat or even increasing with time, while at later times 
the star-formation rate must have declined steadily with time. 

While these trends are interesting and consistent with a plau- 
sible scenario for the chemical evolution of the Galaxy, they 
should not be assigned much weight for the reasons given at 
the start of 87.31 It is also worth noting that the plots in Fig. [18] 
differ in important respects from equivalent plots produced by 
Galaxici. In particular, the synthetic plots for \Z\ > 500 pc do 
not show a region of high density at (t = 6 Gyr, [M/H] t > 0), 
and, with the expected errors in log t, the distribution of ages at 
[M/H] < -0.4 is significantly wider than in Fig. [18] 



8. Conclusions 

We have derived distances (or parallaxes) to ~ 216 000 
stars in the RAVE sur vey using the Bayesian analysis of 
Burnett & Binnev (2010). We have checked the parallaxes and 
their associated errors against Hipparcos parallaxes, and con- 
clude that for dwarfs cooler than T e ff = 6000 K the parallaxes 
are unbiased, but the parallaxes of hotter dwarfs are systemati- 
cally too large by ~ 0. lcr because even the VDR2 pipeline over- 
estimates the gravities of hot dwarfs. The parallaxes of giants 
tend to be too small by ~ O.lcr. For all three classes of star, hot 
dwarfs, cool dwarfs and giants, the scatter in differences between 
the spectrophotometric and Hipparcos parallaxes is consistent 
with the formal errors on the parallaxes. 

We have checked our distances and our errors against dis- 
tances to star clusters, which tend to be beyond the reach of 
Hipparcos. This rather small sample of stars, which contains 
both dwarfs and giants, is consistent with our distances being 
unbiased and their errors being accurate. 

RAVE has obtained more than one spectrum for ~ 19 000 
stars. We have used this sample to assess our errors by compar- 
ing independent distances to the same object. The scatter in the 
difference of distances is only half the quadrature-sum of their 
formal errors. This is consistent with a significant contribution 
to the errors coming from factors, such as defects in the spectral 
analysis and the physics of the stellar models, that are the same 
for repeated determinations of the distance to a given star. 

Comparison of our distances to the spectrophotometric dis- 
tances of Zwi tter et al.l (2010) shows that we assign slightly 
larger distances to dwarfs and smaller distances to giants than do 
Zwitter et al. Given the signs of our deviations from Hipparcos 
parallaxes, it follows that for both dwarfs and giants our dis- 
tances are more accurate than those of Zwitter et al. For dwarfs 
this finding can be traced to the use by Zwitter et al. of the VDR3 
pipeline, which tends to over-estimate log g, especially for hot 
dwarfs. Our distances to giants benefit from a more sophisticated 



prior, which tends to pull stars to smaller distances. For dwarfs 
the distribution of normalised residuals between our distances 
and those of Zwitter et al. is rather narrow, having a dispersion 
of only 0.52, implying that much of the uncertainty in distance 
arises from errors in the original data and the spectral template 
library, which are common to the two studies. 

Our formal errors are based on the cons ervative estimates of 
the errors in the input data given bv lZwitter et a l. (2008). The 
conservative nature of these errors is confirmed b y the analyses 
of both repeat observations and the distances of IZwitter et al.l 
(2010). However, the analysis of Hipparcos stars indicates that 
by happy chance our formal error budget is just large enough to 
encompass external sources of error, such as spectral mis-match 
and deficiencies in the stellar models, so our formal errors are 
close to the final uncertainties in our distances. 

We have examined the distributions of the errors returned 
by the Bayesian analysis for distance, metallicity, age and ini- 
tial mass. The median formal distance error is 28%, the median 
formal uncertainty in [M/H] is 0.17dex, the median formal un- 
certainty in log(r) is 0.27 dex and the median fractional uncer- 
tainty in initial mass is 16%. These figures show that by using 
prior knowledge of the structure of our Galaxy and the nature 
of stellar evolution, one can constrain stellar parameters more 
narrowly than when each spectrum is considered in isolation. 

Data gathered during the pilot par t of the RAVE survey has 
now been released (Sie bert et al.ll201 11) . and the distances we de- 
rive from these data are contained in the release. In view of our 
results for the Hipparcos stars, it may be useful to correct the 
distances of dwarfs with T > 6000 K by increasing their dis- 
tances by 10% of their formal errors, and to correct the distances 
of giants by decreasing their distances by the same amount. 

Our distances reveal which parts of the Galaxy the RAVE 
survey probes. Roughly half the stars in the RAVE catalogue are 
giants and half dwarfs. The giant/dwarf ratio varies strongly with 
Galactic latitude and to a weaker extent with Galactic longitude. 
The structure of the variation is accurately predicted by Galaxia 
when obscuration by dust is included, which significantly re- 
duces the fraction of giants seen towards the Galactic Centre. 
Although the spatial distribution of RAVE stars reflects the sur- 
vey's selection function as well as the intrinsic stellar density of 
the Galaxy, it nonetheless reveals the double-exponential struc- 
ture of the disc. Moreover, the distribution of stellar ages shows 
the expected concentration of young stars towards the plane. 

The metallicity distribution evolves systematically with dis- 
tance from the plane, being remarkably narrow and slightly sub- 
solar at \z\ < 150 pc, to much broader and centred on [M/H] =* 
-0.5 more than 2 kpc up. Our results support the view that obser- 
vational errors have the biggest impact on the observed metallic- 
ity distribution near the plane. 

This work has brought into sharp focus the crucial impor- 
tance of the pipeline that extracts stellar parameters from the raw 
spectra. A valuable upgrade to the current pipeline would be to 
force the parameters assigned to a star to be consistent with mod- 
els of stellar evolution. This upgrade, which is mandatory from 
the perspective of Bayesian inference, would eliminate the high 
density of stars in the bottom-right panel of Fig. [2] at logg =* 5 
and r e ff > 6000 K. This study also highlights the need for a more 
sophisticated analysis of the errors in the parameters returned by 
the pipeline. For example, one would expect the error on logg 
to depend on [M/H] in addition to S/N, which we have had to 
assume has complete control of the errors in the input parame- 
ters. Moreover, near the turnoff the errors in T^ and logg will 
be quite strongly correlated, and we have had to neglect this fact. 
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All improvements in the pipeline will feed through into more ac- 
curate distances. 

The distances and stellar parameters described here provide a 
basis for extensive work on the structure, kinematics and dynam- 
ics of our Galaxy. Work on the Galaxy's kinematics is already 
underway, and papers in this area will appear shortly. Work on 
the Galaxy's dynamics requires characterisation of the intrinsic 
density distribution of the population(s) whose kinematics have 
been measured. Determining those densities involves determina- 
tion of the survey's selection function. This is the next major task 
that must be accomplished before the RAVE survey can attain its 
ultimate goals. 
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